-
Notifications
You must be signed in to change notification settings - Fork 3k
Support all negative values in ClassLabel #4511
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
The documentation is not available anymore as the PR was closed or merged. |
mariosasko
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, LGTM!
* support all negative valeus in ClassLabel * support None in cast_storage
|
Thanks for this fix! I'm not sure what the release timeline is, but FYI #4508 is a breaking issue for transformer token classification using Trainer and PyTorch. PyTorch defaults to -100 as the ignored label for negative log loss, so switching labels to -1 leads to index errors using Trainer defaults. As a workaround, I'm using master branch directly ( |
|
The new release |
|
@lhoestq I hope it's OK to ping you here. I've noticed that I am talking about this part in datasets/src/datasets/features/features.py Lines 1126 to 1129 in e71b0b1
|
We usually use -1 to represent a missing label, but we should also support any negative values (some users use -100 for example). This is a regression from
datasets2.3Fix #4508